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ABSTRACT 

Subhalo abundance matching (SHAM) inserts galaxies into dark matter only simulations of 
the growth of cosmic structure in a way that requires minimal assumptions about galaxy for- 
mation. A galaxy is placed at the potential minimum of each distinct self-bound subhalo with 
a luminosity which is a monotonically increasing function of the maximum mass (or circu- 
lar velocity) attained over the subhalo's earlier history. Galaxy and subhalo properties are 
linked by matching model and observed luminosity functions. Simulated structures can then 
be compared in detail with observation, for example, through galaxy correlation statistics, 
group catalogues, or galaxy-galaxy lensing. Physically meaningful results require the abun- 
dance and clustering of suhalos to be unaffected by numerical resolution. Here we use the 
Millennium and Millennium-II simulations with subhalos defined using the SUBFIND algo- 
rithm to explore the limitations imposed by this requirement. Correlation statistics on scales 
between 200 kpc and 2 Mpc converge to within 20% only for subhalos with maximum past 
masses corresponding to 1000 simulation particles or more. This is substantially above the 
mass limit to which results are quoted in recent SHAM analyses. Detailed galaxy formation 
simulations based on semi-analytic techniques converge to much lower mass, because they 
follow galaxies even after their associated subhalos have been tidally disrupted. 

Key words: cosmology: theory - cosmology: dark matter mass function - galaxies: luminos- 
ity function, stellar mass function - galaxies: haloes - methods: N-body simulations 



1 INTRODUCTION 

The ACDM model has proved consistent with observed cos- 
mic structure over a wide range of scales and epochs, for ex- 
ample, the fluctuatio ns in the cosmic microwave background 
jDunklevetal1l2009h. th e Lym an a forest absorption in quasar 
spectra ^McDonald et alJ l2006h and the large-scale clustering of 
galaxies flPercival et alj|2010l) . In this scenario, small halos form 
first and then progressively accrete and merge to form larger 
systems, a process which is quite well des c ribed by the sim- 
ple excursi on set model of [P ress & Schechter dl974 and its later 
extensions teond et all 1 199 it iBowerl Il99ll : iLacev & Coldll993l : 
IShem&Torrnenlll999h . Current N-body simulations can follow 
dark halo growth at high resolution and in its full cosmological 
conte xt for objects ra nging from halos too small to host galaxies 
(e.g. lGaoetal.ll2004h t o tho s e hosting the most massive galaxy 
clusters (e.g. iGao et af]|2012t lAngulo et ai1l2012l) . While there is 
good agreement on the structure predicted in the absence of bary- 
onic effects, this agreement evaporates when such effects are in- 
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eluded, and there is little consensus on the properties of the galaxy 
which s hould form in any part icular halo (see, for example, the re- 
view of iFrenk & White! I2OI2I) . This is because galaxy formation 
involves complex and poorly understood processes linking a very 
wide range of scales, in particular, star and black hole formation 
and the strong feedback processes which they generate. 

In the standard scenario, galaxies form as gas cools 
and condens es at the cen t res of the evolving population of 
dark halos l lWhite & Reesl 1 1978h . The most direct way to 
study galaxy fo rmation in a ACDM c osmology is v i a full 
simulation (e.g. iNayarro & White! Il994t ICen & Ostrikerl l2000t 
Springel & Hernquist 2003]; iKeres et al.l |2005t IPfrommer et al.l 
l2006t iGovernato et alj|2012t iGuedes et alj|2012h . Such hydrody- 
namic simulations follow the gas dynamics in considerable de- 
tail, but they are expensive in terms of both memory and CPU 
time, and their results depen d strongly on the recip es adopted 
for unresolved processes (e.g. IScannapieco et alj|2012|) . An alter- 
native is provided by so-called semi-analytic models which em- 
ploy simple, physically a nd observationally motivated prescrip- 
tions for baryonic physi cs l lWhite &~F renk 1991; Kauffman n et al.l 
1 19931 : ICole et alJ 1 1994) . These can be implemented on halo or 
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subhalo merger trees extracted from dark-matter-only simulations 
and are useful for exploring evolution of th e galaxy population 
dKauffmann et alj 1 19991: ISpringel et alj l200ll: lHatton et alj [20031 : 
; Be 



iKang et alj 120051 ; bower et alj f2006l ; TGuo et alj|201ll) . Many ob- 
servables can be predicted rapidly for large number of galaxies, 
enabling a detailed check of a model's ability to reproduce the 
abundance, clustering and intrinsic property distributions of galax- 
ies over the full range of observed redshifts, while simultane- 
ously estimating the efficiency of each of the processes involved 
dGuoetalJl201ll : iHenriques etai]|2012l) . Such simulations are par- 
ticularly useful for calibrati ng and interpreting large observational 
surve y s (e.g. IScoville et al. 2007; Coil et al. 2008; Meneux et al. 

Adami et al, 
20121 : 



2008; Guzzoetal. 2008; McCracken et al. 2010 



201C ; Ide la Torre et alj 12011k iKnobel et alj I2OI2I: 



Li et al. 



Planck C ollaborat ion et al Their main limitations arise from 



the simplicity of the individual recipes adopted, and from the very 
limited information they provide about the internal structure of 
galaxies. 

In recent years, subhalo abundance matching (SHAM) has 
become a popular method for modeling the la r ge-scale dis- 
tribut i on of galaxies (e 



Vale & Ostriker 2004; 



20061: IShankar et alj|2006l: iBaldrv et a l. 2008; M osteret al.ll20ld : 



Guo et alfcoToT Behroozi et al Jl2010l : IWake et alj201 lh . The 



idea is to place galaxies in dark matter subhalos assuming a mono- 
tonic relation between the stellar mass (or luminosity) of the galaxy, 
and the maximum mass (or maximum circular velocity) ever at- 
tained by the subhalo during its earlier history. This is a strong as- 
sumption which is almost certainly incorrect in detail - for exam- 
ple, the properties of the galaxy are likely to depend on the redshift 
at which its subhalo attains i ts maximum mass as well as on the 
mass itself I Wa ng et al. I l2006l) - but the method is convenient be- 
cause it requires minimal assumptions and avoids the need to look 
into the physics of galaxy formation. A consequence, of course, is 
that it gives relatively little information about galaxy formation. For 
example, the stellar mass function of galaxies is reproduced by con- 
struction, rather than constraining the efficiency of star and black 
hole formation and of the associated feedback. Galaxy formation 
information can be backed out by requiring the stellar mass-halo 
mass relations implied by data at different redshifts to be consis- 
tent with the growth of structure expected in th e ACDM model 
(e.g. lConrov & Wechsler|[2009l; iMoster et alj|2013l) . Validating the 
SHAM method requires comparisons with i ndependent observa- 
tions, e.g., galaxy correlation functions (e.g. Conrov et alj|2006l : 
IMoster et alj 201(3; Truiillo-Gomez et alj 201 1 ). group catalogues 



(e.g. iMore et alj|201 ll ; iHearin et al. 201^ ), galaxy-galaxy lensing 
and satellite abundances (e.g. Guo et al] l2010l : iLeauthaud et alj 
l2012l;lReddick et al.l2012l) . Overall the agreement is good, suggest- 
ing that the fundamental assumption is not seriously in error. 

Dark halos are tidally stripped very rapidly once they merge 
into larger systems, and any particular simulation may lose 
track of a subhalo before the galaxy it contains is disrupted or 
merges with the central object of its new halo. Subhalo abun- 
dances, hence SHAM results, are thus likely to be affected by 
numerical reso lution. Here we stud y this issue using the Mil- 
lennium (MS. ISpringel et alj 120051) and Millennium-II (MS-IL 
iBovlan-Kolchin et al J 1200917 simulations, two very large dark- 
matter-only simulations of structure growth which differ in mass 
resolution by a factor of 125. We summarize the main features of 
these simulations in Sec. 12.11 Since we will compare the numeri- 
cal convergence of clustering statistics between SHAM and semi- 
analytic galaxy catalogues, in Sec . |2~2l we briefly describe the semi- 
analytic model IGuo et al. I d201lh which implemented on the two 



simulations and which we will use here. In Sec B.ll we study how 
the abundance of subhalos as a function of maximum past subhalo 
mass and of maximum past subhalo circular velocity is affected 
by numerical resolution, while in Sec. 13.21 we show how resolu- 
tion affects the small-scale clustering of subhalos of given abun- 
dance. Sec. |3.3| then contrasts the rather poor convergence we find 
for small-scale clustering in SHAM catalogues, with the much bet- 
ter behaviour seen for the corresponding statistics in the catalogues 
based on semi-analytic simulations. Sec. [4] summarizes our main 
results. 



2 SIMULATIONS 

In this section, we summarize the main features of the two simula- 
tions used in our analysis, and we briefly describe the semi-analytic 
models used to generate the simulated galaxy catalogues we com- 
pare with our SHAM catalogues. 



Conrov et alj 2.1 The MS and MS-II 



This analysis is based on two cosmological simulations, the Mil- 
lennium and the Millennium-II. Both assume a ACDM cosmol- 
ogy with parameters based on a combination of data from the 
2dFGRS dColless & et alj|200ll) and the first-year WMAP release 
dSpergel et al.l2003l) : il m = 0.25, ft A = 0.75, U b = 0.045, n = 1, <r 8 
= 0.9 and h = 0.73. These parameters are at best marg inally consis- 
tent w ith the most recent nine-year WMAP results) Hinshaw et al. 
120121) . However, this is of no consequence for the analysis we 
present in this paper. 

The MS and the MS-II were carried in periodic volumes of 
side 500h _1 Mpc and 100/i~ 1 Mpc, respectively. Both simulations 
used 2160 3 particles to follow the dark matter distribution from 
redshift 127 to the present day. There are 64 and 68 snapshots for 
the MS and the MS-II respectively, with the last 60 being identi- 
cal in the two simulations. At each output time, friends-of-friends 
(FOF) groups were identified b y linking particles with separation 
less than 0. 2 of the mean value fcavis et alJI 19*851) . The SUBFIND 
algorithm dSpringelet ail 1200 ll) was then applied to identify all 
self-bound subhalos within each FOF halo, and every subhalo was 
linked to a unique descendent at the next later output time in order 
to construct merger trees describing the complete assembly history 
of each 2 = subhalo. Halo/subhalo data were stored only for ob- 
jects containing at least 20 particles, so that the smallest resolved 
subhalo in the MS has mass 2.4x10 10 Mq, while the correspond- 
ing resoluti on limit in the MS-II is 1.9xlO s Mp). The readers are 
referred to ISpringel et ail d2005l) and IBovlan-Kolchin et all 1 2009) 
for more detailed descriptions of the two simulations. 

In each FOF group, the most massive self-bound subhalo is 
referred to as the "main subhalo" or "halo". The center of the main 
subhalo is taken to be the gravitational potential minimum of the 
FOF group. The virial radius R V i r of the main subhalo is defined as 
the maximum radius within which the mean density is 200 times the 
critical value. The total mass enclosed is then defined as the virial 
mass of the group, M V i r . Within the virial radius, the maximum 
circular velocity is defined as 



Vm 



(1) 



Vinax is a useful indicator of potential well depth for subhalos, since 
it is less affected by stripping and boundary definition effects than 
subhalo mass. 
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In the standard galaxy formation scenario, galaxy properties 
are expected to be closely related to the mass and potential well 
depth of the halo in which they form. When a halo falls into a more 
massive system, it progressively loses its outer regions until it is 
eventually destroyed. The galaxies, however, are much more com- 
pact, and so are little affected by tides until the final stage of dis- 
ruption/merging. The baryonic properties of satellite galaxies are 
thus expected to be more closely related to the properties of their 
subhalo at the time of infall than to its current properties. In this 
paper, we follow the standard procedure in recent SHAM studies, 
and characterise subhalos by their mass or maximum circular ve- 
locity at infall (defined as the latest time when they were the main 
subhalo of their FOF group) when estimating their abundance for 
matching purposes. For the main subhalo of each group this is the 
current value of these quantities. In both cases, however, we refer 
to the quantities as A/ max and V m ax, respectively, even though the 
past maximum mass of a satellite subhalo does not always occur at 
the time of infall. 



2.2 Semi-analytic Models 

The semi-analytic simulation technique populates halos with galax- 
ies by applying a set of physically or observationally motivated 
recipes to merger trees extracted from N-body simulations. In this 
pa per we wil l use galaxy catalogues from the semi-analytic model 
of Iguo et al.l feOllh which was implemented simultaneously on 
the same MS and MS-II subhalo catalogues which we use for our 
SHAM modelling. This semi-analytic model follows the reioniza- 
tion, infall, shock-heating, radiative cooling and condensation of 
diffuse gas onto galactic disks; the star formation within these disks 
and the associated metal production and wind generation; disk in- 
stabilities and galaxy mergers with their associated starbursts, for- 
mation of bulges, and formation and fuelling of central black holes; 
and the feedback from these black holes into the surrounding hot 
gas atmospheres. The model successfully reproduces the proper- 
ties, abundances and clustering of present-day galaxies ranging 
from dwarf spheroidals to giant cDs. In this model all subhalos 
have a galaxy at their centre but not all galaxies are associated 
with a subhalo. When a subhalo is tidally disrupted, a timescale 
for orbital decay is calculated from the position of the subhalo just 
prior to disruption and from the masses of the subhalo, of its central 
galaxy and of the main halo within which it orbits. The position of 
the associated "orphan" galaxy relative to main halo centre is then 
taken to be that of the most bound particle of the pre-disruption 
halo, shrunk according to a simple dynamical friction model so that 
a merger occurs one dynami cal friction time a fter subhalo disrup- 
tion. Details may be found in lGuo et al 



3 RESULTS 

3.1 Halo/subhalo abundance 

iBovl an-Kol chin et ID (Hi) tested for numerical converence be- 
tween the MS and the MS-II by comparing a number of standard 
quantitative measures of structure in the mass distributions of the 
two simulations. In particular, they calculated and compared mass 
autocorrelation functions at redshifts z — 0,1,2 and 6 finding 
agreement to better than about 10% on physical scales between 
20/i _1 kpc and 1.0/i _1 Mpc and to better than 20% out to a co- 
moving scale of 10/i _1 Mpc. They also compared FOF halo mass 
functions at z — and z = 6 again finding agreement to within 
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Figure 1. Upper panels: Cumulative abundance of subhalos as a function 
of their maximum past mass (actually their mass at infall). Red and green 
curves are results from the MS and the MSII respectively. Lower panels: 
ratios of the MS and MS-II abundances shown in the upper panels. Left 
panels are for 2 = and right for z = 1. In all panels, solid curves denote 
the full subhalo abundance, while dashed and dot-dashed curves separate 
out the contributions from satellite subhalos and main subhalos respectively. 



about 10% above a mass of 4 xl0 10 M Q (around the minimum 
halo mass resolved in the MS). A similar test was carried out by 
lAngulo et alj J2012h who in addition compared results with the 
much larger Millennium-XXL simulation. 

Both these statistics depend primarily on the abundance, in- 
ternal structure and spatial distribution of halos and are relatively 
insensitive to substructure. This explains why good results are ob- 
tained down to masses corresponding to only a few tens of par- 
ticles. Much larger resolution effects are found for statistics which 
depend on the properties of subhalos. Fig.[T]illustrates this point for 
the particular case of the subhalo abundance as a function of past 
maximum mass, a statistic often used in subhalo abundance match- 
ing (SHAM) analyses. The overall abundances are shown by solid 
curves, with the contributions from main subhalos and satellite sub- 
halos indicated by dot-dashed and dashed curves respectively. The 
main subhalo mass functions agree to within 10% between the two 
simulations for masses above a few 10 10 Mq, just as for FoF halo 
masses, but differences of this order appear for satellite subhalos al- 
ready at masses one hundred times greater. As a result, the overall 
subhalo mass function of the MS agrees with that of the MS-II to 
within 10% only above 10 12 M Q at z = 0, and above 2 x 10 u M o 
at z = 1. Thus, SHAM assignment of galaxies to simulations will 
only give numerically converged results at the 10% level for maxi- 
mum past masses corresponding to ~ 10 3 particles or more. 

SHAM analyses often consider subhalo abundance as a func- 
tion of Vmax rather than Af max , since Vmax is a useful indicator of 
subhalo potential well depth and is less sensitive to the details of 
subhalo identification than M max - Fig. [2] is identical in format to 
Fig.[T]except that abundances are shown as functions of V ma x. The 
change in parametrisation has little effect on the convergence be- 
haviour. The abundances of main subhalos agree within 10% for 
V m ax values above about 80 km/s, but similar agreement is ob- 
tained for satellite subhalos only for V ma x > 310 km/s. As a result 
a SHAM analysis based on the MS is converged at the 10% level 
only for galaxies with rotation velocities similar to the Milky Way. 
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Figure 2. Similar to Fig. 1 but for subhalo abundances as a function of the 
maximum past value of peak circular velocity V max . 



3.2 Subhalo correlations 

As the last subsection shows, differences in subhalo abundance be- 
tween the two Millennium Simulations are primarily a reflection of 
lack of convergence in the properties of low-mass satellite subha- 
los. As a result, one may expect the small-scale clustering of sub- 
halos to show even more serious disagreement. This is explored in 
Fig. [3] which compares autocorrelation functions for mass-limited 
subhalo samples chosen to have exactly matched abundances in the 
two simulations. The four panels in each set correspond to abun- 
dances differing by factors of three, as noted by the black label 
in each panel. The red and green labels indicate the lower limits 
on maximum past mass for the MS and MS-II respectively, with 
the same colours used for the corresponding curves. The left set of 
panels shows results for z = while the right set shows results 
for z = 1. We use this particular matching scheme in order to in- 
dicate the expected disagreement in galaxy correlations if the two 
simulations are populated with galaxies using SHAM. 

The highest subhalo abundance considered (n = 3 x 
10 -2 Mpc -3 ) corresponds to a lower limit on maximum past halo 
mass of about 5 x 10 10 Mq both at z — and at z = 1. This is 
about the mass of the halo thought to host the Small Magellanic 
Cloud (e.g. lGuoetal1l2010h . To this limit the clustering amplitude 
of subhalos at z = is lower in the MS than in the MS-II by 20% 
on large scales, by a factor of two at r p — 2 Mpc, and by a factor 
of about four on 100 kpc scales. The discrepancy is similar on large 
scales at z = 1, but only about half as big on Mpc scales and be- 
low. For sparser samples, corresponding to higher minimum values 
of Mmax, the agreement improves, but only for the sparsest sample 
(n = 10~ 3 Mpc~ 3 , M max > 2 x 1O 12 M ) is it better than 20% 
on all but the smallest scales. Thus at the resolution of the MS, the 
SHAM procedure gives "precision" results for present-day cluster- 
ing only for subhalos for which A/ max is greater than about 10 3 
times the mass of a simulation particle, corresponding to galaxies 
more massive than the Milky Way. At z = 1 the convergence is 
already good for M max values which are three times smaller. 

As shown in Fig. [4] very similar discrepancies are seen if sub- 
halo samples are defined above a limiting value of Kn ax rather than 
M max . Indeed, comparison with Fig.|3]shows the agreement to be 
significantly worse, both at z — and at z — 1, when this suppos- 
edly better proxy for galaxy mass is used for abundance matching 
in the two simulations. At z = the ratio reaches a factor of eight at 
a separation of 200 kpc for the highest density samples. The large- 
scale discrepancy is also bigger than in samples selected by M max . 



This is because at given mass (sub)halos formed at higher redshift 
are more compact and hence have higher Vmax- Thus, given that 
satellites typically assembled earlier than centrals of similar V max 
or M max , ranking by the former rather than the latter increases the 
ratio of satellite to central subhalos at each abundance. 

3.3 Autocorrelations for SA galaxies 

Our semi-analytic model follows satellite galaxies from the time 
their associated subhalos are disrupted until the galaxies them- 
selves are tidally destroyed, or they merge with a more massive sys- 
tem. If the orbits of such orphan galaxies are modelled adequately, 
the galaxy population should be less subject to convergence prob- 
lems of the kind discuss ed above than th e subhalo population we 
have focussed on so far. iGuo et all ( 1201 ll) show that galaxy abun- 
dances agree to better than 10% between the MS and the MS-II 
down to stellar masses of about 2.5 x 10 9 Mq, corresponding to 
n ~ 1.2 x 10 _2 Mpc -3 . In addition, at given stellar mass, they 
show galaxy autocorrelations to agree to similar accuracy over the 
full separation range between 100 kpc to 10 Mpc. While clearly 
encouraging, these convergence tests are not directly comparable 
to those we presented above. We have therefore created matched, 
stellar-mass-limited samples of galaxies from the MS and MS-II 
both at z — and at z — 1 and we compare their clustering exactly 
as we did for abundance-matched subhalo samples in Figs.|3]and|4] 
The results are shown in Fig. [5] A first point to note is that, ex- 
cept for the densest samples at z = 0, the stellar mass limits for the 
two samples in each panel never differ by more than about 5%. This 
again emphasises how well the stellar mass functions agree in the 
two simulations, much better than the corresponding subhalo abun- 
dance functions n(M max ) and n(Kn ax ) (e.g. compare with the dif- 
ferences in threshold listed in the panels of Figs[3]and|4j- The most 
significant point, however, is that, apart from some noise effects on 
small scales, the autocorrelation functions in the two simulations 
agree almost perfectly in all panels across the full range of scales 
plotted. This is a strikingly different situation from those shown 
in Figs [3] and [4] for the abundance-matched subhalo samples. The 
relatively simple orphan-tracking methods used in the SA models 
extend the range of scales over which numerically converged re- 
sults for galaxy clustering can be obtained by about two orders of 
magnitude in subhalo mass. 



4 DISCUSSION 

Subhalo abundance matching (SHAM) aims to provide a direct, 
physical and relatively assumption-free scheme for using large cos- 
mological N-body simulations to interpret observational surveys of 
galaxy clustering and large-scale structure. Its principal assumption 
is that an observable property of the galaxies (for example their r- 
band luminosity or their stellar mass) can, to sufficient accuracy, 
be assumed to be a monotonic function of some dark halo property 
(for example the past maximum of mass or peak circular velocity) 
which can be reliably measured for subhalos identified in an N- 
body simulation. Abundance matching then allows subhalos in the 
simulation to be populated with galaxies with known values of the 
observable. 

Regardless of whether this procedure is an adequate repre- 
sentation of the galaxy formation process, it will only predict nu- 
merically converged properties for the galaxy population on scales 
where the abundance of subhalos as a function of the chosen prop- 
erty is independent of parameters such as simulation size and res- 
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Figure 3. Two-point autocorrelation functions for subhalo samples defined above various lower limits on Af max , the maximum past subhalo mass. As in Fig. 
1, red curves are for MS samples, green curves for MS-II samples. The left set of panels shows data for z = while the right set are for z = 1. Each panel 
compares results for samples with the abundance shown in black in its upper right corner. The corresponding lower limits in M max are also shown, with 
colour indicating the relevant simulation. Note that these mass limits do not agree precisely beause of the lack of convergence seen in Fig. [T] 
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Figure 4. Similar to Fig. [5] but here for samples selected above a threshold in V max , the maximum value of maximum circular velocity attained over each 
subhalo's history. Coloured labels in each panel now show the values of Vmax at threshold which correspond to the chosen abundance in each of the two 
simulations. They do not agree because of the lack of convergence seen in Fig. [2] 



olution. To investigate this issue, we compared subhalo catalogues 
constructed using the SUBFIND algorithm from the MS and MS-II, 
two simulations differing in mass resolution by a factor of 125. We 
found subhalo abundances to agree to better than 10% only for sub- 
halos with maximum past masses corresponding to 10 3 particles or 
more in the MS. Discrepancies arise at lower mass because satel- 
lite subhalos are tidally disrupted too early in the lower resolution 
simulation. This has a particularly marked effect on the small-scale 



clustering of subhalos, and SHAM predictions for galaxy autocor- 
relation functions at separations of 1.0 Mpc or less can be in error 
by factors of two or more if they use subhalos significantly less 
massive than the thousand particle limit. 

Using the maximum past value of peak circular velocity 
(V m ax) rather than of mass (M max ) to characterize subhaloes in the 
SHAM procedure makes numerical convergence problems worse 
rather than better. This is because at given abundance the frac- 
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Figure 5. Similar to Fig. [5] but for semi-analytic galaxy samples selected above thresholds in stellar mass. Coloured labels in each panel now show the 
thresholds required to match the chosen abundance in each of the two simulations. They agree quite closely because the stellar mass functions of the two 
simulations almost coincide at the relevant masses. In contrast to Figs[5]and[4] the autocorrelation functions here agree very well between the two simulations. 



tion of satellites is larger in V max -liniited subhalo samples than in 
-M max -limited samples. At the resolution of the MS the cluster- 
ing predicted by either SHAM scheme is not well converged even 
for galaxies of similar mass to the Milky Way. Specifically, pre- 
dicting autocorrelation functions accurate to 20% or better requires 
M max > 1O 12 M or V m ax > 200 km/s. Note that this conclusion 
may depend on the specific alg orithm used to identify subhalos. 
The study of lOnionset al. (2012) suggests that relatively small dif- 
ferences are likely for the algorithms in frequent general use, and 
that SUBFIND is fairly typical. Our own study makes clear that 
SHAM should not be used for "precision" interpretation of clus- 
tering observations without careful convergence testing of the kind 
we present. 



In contrast , we find that clus tering of the semi-analytic galaxy 
populations of iGuo et al.l J20 1 lh converges remarkably well be- 
tween the two Millennium simulations. The treatment of "orphan 
galaxies" in these models does a surprisingly good job of remov- 
ing the resolution dependence of satellite disruption, the principal 
cause of discrepancie s between the SHA M catalogues. This was 
already recognised by Mos ter et al.l d201 3l) who augment ed the MS 
and M S-II subhalo catalogues with orphan galaxies from lGuo et al.l 
(201 l|) in their otherwise SHAM-based analysis of galaxy evolu- 
tion. In practice, however, SA models have a substantial advantage 
over SHAM schemes when interpreting large observational surveys 
of clustering not only because their treatment of orphans expands 
by almost two orders of magnitude the range of subhalo masses 
which can be robustly populated with galaxies, but also because 
their physically realistic treatment of galaxy evolution guarantees 
consistent assignment of galaxies to subhalos over the relatively 
broad range of redshifts spanned by the primary galaxy samples of 
most next-generation surveys. 
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